Processing of noisy speech using partial phase

نویسندگان

Bayya Yegnanarayana

K. V. Madhu Murthy

Hema A. Murthy

چکیده

This paper explores the possibility of processing noisy speech using signal reconstruction algorithrns frorn Fourier Transform (FT) phase and rnagnitude. Algorithrns have been proposed in the literature for signal reconstruction frorn FT phase alone, or, frorn FT rnagnitude with additional inforrnation in the form of 1-bit phase or signal values. More recently, algorithrns have been proposed for signal reconstruction frorn partial phase (phase inforrnation in selected frequency bands) with cornpensating nurober of signal sarnples. In this paper we exarnine application of these techniques for processing noisy speech. In particular, we show that by selectively processing high signal-to-noise ratio(SNR) regions we can reduce the effect of background additive noise significantly.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

Waveform estimation using group delay processing

A method of signal waveform estimation from an ensemble of jittered noisy measurements is presented. The method uses group delay functions to perform the ensemble averaging and thus overcomes the difficulty of computing the unwrapped phase function before averaging. We propose a new technique, called group delay processing, to estimate the signal waveform if only a single noisy measurement is a...

متن کامل

Speech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering

Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explicit Bayesian estimations in Wiener filtering of noisy speech. No assumption is made on the nature or stationarity of the noise. No voice activity detection (VAD) or any other means is employed to estimate the input SNR. The GMM mean vectors are used to form sets of over-determined system of equatio...

متن کامل

DNN-Based Amplitude and Phase Feature Enhancement for Noise Robust Speaker Identification

The importance of the phase information of speech signal is gathering attention. Many researches indicate system combination of the amplitude and phase features is effective for improving speaker recognition performance under noisy environments. On the other hand, speech enhancement approach is taken usually to reduce the influence of noises. However, this approach only enhances the amplitude s...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1987

Processing of noisy speech using partial phase

نویسندگان

چکیده

منابع مشابه

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Improving the performance of MFCC for Persian robust speech recognition

Waveform estimation using group delay processing

Speech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering

DNN-Based Amplitude and Phase Feature Enhancement for Noise Robust Speaker Identification

عنوان ژورنال:

اشتراک گذاری